On Hats and other Covers 

(Extended Summary) 
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Abstract. We study a game puzzle that has enjoyed recent popularity among mathematicians, computer 
scientist, coding theorists and even the mass press. In the game, n players are fitted with randomly assigned 
colored hats. Individual players can see their teammates' hat colors, but not their own. Based on this 
information, and without any further communication, each player must attempt to guess his hat color, or 
pass. The team wins if there is at least one correct guess, and no incorrect ones. The goal is to devise 
guessing strategies that maximize the team winning probability. We show that for the case of two hat 
colors, and for any value of n, playing strategies are equivalent to binary covering codes of radius one. 
This link, in particular with Hamming codes, had been observed for values of n of the form 2 m — 1. We 
extend the analysis to games with hats of q colors, q > 2, where 1-coverings are not sufficient to characterize 
the best strategies. Instead, we introduce the more appropriate notion of a strong covering, and show 
efficient constructions of these coverings, which achieve winning probabilities approaching unity. Finally, we 
briefly discuss results on variants of the problem, including arbitrary input distributions, randomized playing 
strategies, and symmetric strategies. 
Keywords: coding theory, covering codes, games 

1 Introduction 

The following game puzzle has recently received attention from mathematicians, computer scien- 
tists, coding theorists, and even from the mass press [Q: A team of n players enters a game room, 
and each player is fitted with a hat, which is either red or green. A player can see the other players' 
hat colors, but not his own. Each player is then asked to make a declaration, which must be one 
of the statements "my hat is red," "my hat is green," or "I pass." All the players must declare 
simultaneously, and no communication is allowed between them during the game. They are per- 
mitted, however, to hold a strategy coordination meeting before the game starts. The team wins if 
at least one player declares his hat color correctly, and no player declares an incorrect color. The 
distribution of hat color combinations is assumed, for the moment, to be random and uniform, i.e., 
all 2 n combinations are equally likely. The goal of the team is to devise a strategy that maximizes 
the winning probability. 

A winning probability of 50% is guaranteed by a trivial strategy in which a designated player 
declares "red," and the rest "pass." The puzzle is popularly posed for n = 3. In that case, 
the following strategy yields a 75% winning probability, which turns out to be optimal: Upon 
observing the hats of the teammates, if a player sees two identical colors, he declares the opposite 
color, otherwise he passes. Clearly, the team wins whenever the color configuration consists of two 
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hats of the same color and one of the opposite color, and it loses when the three hats are of the 
same color. Moreover, the following property holds: 



In winning configurations, one player declares the correct color, and the rest pass; in 
losing configurations all the players declare a wrong color. 

For any value of n, a strategy satisfying this property is said to be perfect. In addition to being 
perfect, the described team strategy for n = 3 is also symmetric, in that the individual strategies 
of the players are identical. In general, strategies need not be symmetric; players have identities, 
and are allowed to use distinct individual strategies. The symmetric constraint for general n is 
discussed in Section |IJ 

We will be interested in the behavior of the winning probability for the best strategies as a 
function of n, and in its asymptotic behavior as n — > oo. The rest of the summary is organized as 
follows: in Section |2] we formalize the notion of a deterministic strategy for the two-color (binary) 
hats game, and we show that, for any n, such a strategy is equivalent to a binary covering code 
of length n and radius one. Consequently, the best strategies are equivalent to the best (smallest) 
covering codes, and their winning probabilities approach one when n — > oo. The relation between 
the binary hats game and codes had been previously observed, in particular in relation to Hamming 
codes; see, e.g. 011], wnere the game is presented as a "prisoner's puzzle," and optimal strategies 
are defined for values of n for which Hamming codes exist. In Section |*3 we extend the game to q 
possible hat colors, q > 2, while leaving the playing rules unchanged. We show that g-ary covering 
codes are not sufficient to describe the best strategies when q > 2, and define the notion of a strong 
covering, which obeys much stricter combinatorial constraints. We present efficient constructions of 
strong coverings of asymptotically vanishing relative size, and whose associated playing strategies, 
therefore, win with probability approaching one. Finally, in Section 0J we briefly discuss results on 
other variants of the game, which will be covered in detail in the full version of the paper. 

2 Optimal Strategies for the Binary Game 

Let the symbols and 1 represent the colors in the binary hats game, let ./V = {1,2, . . . , n}, and 
let V n = {0, 1}™ denote the n-dimensional binary space, n > 2. 1 A deterministic n-player strategy 
is a vector J 7 = (fi, /2, . . . , f n ), where the function fi : V n -\ — ► {0, 1, pass}, i € N, encodes the 
instructions for player i, i.e., upon observing u G V n -\, player i declares /«(u). Clearly, given a 
hats configuration vg^, and a strategy T , it is uniquely determined whether J- wins on v or not. 
For a given strategy J-, let Wjr C V n denote the set of winning configurations for T . Translating 
the rules of the hats game to this notation, we have 



A covering code of radius r, or r- covering of V n (in the Hamming metric) is a set C C V n such 
that for all v G V n there is a vector c G C satisfying d(v, c) < r, where d denotes the Hamming 
distance. A comprehensive treatment of covering codes, including an extensive bibliography, can be 
found in 0. It follows from the characterization of in Q that if (w%, W2, ■ ■ ■ , w n ) is a winning 

x It is readily verified that for n £ {1, 2}, the best achievable winning probability is the trivial 50%. 
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configuration then, for some coordinate i G N, we have (wi, W2, ■ ■ ■ , wl, • • • , w n ) g" Wp, where x 
denotes the Boolean complement of x. Therefore, for any deterministic strategy V n \ is a 
1-covering of V n , which we shall denote Cp. Conversely, let C be a 1-covering of V n , and define the 
following playing strategy Tc- 

For i G N, and u = (w±, W2, • • • , Wi-i, Wi + \, . . . , w n ) G V n —i, if exactly one value 
x G {0, 1} satisfies (wi, W2, . . . , itfj-i, a;, tUj+i, . . . , w n ) €" C, then fi(u) = x; otherwise, 
/i(u) = pass. 

Proposition 1 The set of winning configurations of Tq isV n \C. 

Proof. Assume w G V n \ C . Since C is a 1-covering, there exists an index i G N such that flipping 
the ith coordinate of w results in a vector in C . Therefore, player i will declare the right color on w. 
Players j 7^ i have at least one way to make the vector fall outside of C (namely, x = Wj). Hence, 
they either guess right or they pass, and w is a winning configuration. The converse direction is 
straightforward. □ 

It follows from the foregoing discussion that there exists a one-to-one correspondence between 
deterministic strategies and 1-covering codes. Assuming a uniform distribution on the input con- 
figurations, the strategy Tc corresponding to a 1-covering C has losing probability 

P L = \C\/\V n \=2- n \C\. (2) 

In the sequel, we omit the subscripts J- and C when the correspondence between strategies and 
coverings is clear from the context. 

The strategy for n = 3 described in Section ^ corresponds to C = {000, 111}, i.e., the [3,1] 
repetition code, which is also the Hamming code of order 2. This code is invariant under the action 
of the symmetric group S3 on its coordinates, thus resulting in a symmetric strategy. 

Covering codes obey the sphere- covering bound, which, for radius one, takes the form 

\C\ >2 n /(n + l). (3) 

As is well known, the mth order binary Hamming code 7i m , m> 1, has length n = 2 m — 1, and it 
satisfies the sphere-covering bound with equality, namely, it is a perfect code. This corresponds to 
a perfect strategy for the hats game, as defined in Section^ It is also well known that any perfect 
binary 1-covering must have the same parameters (n, |C|,d) as a Hamming code 2_. Notice that 
Hamming codes with n > 3 are not invariant under S n , and the associated strategies are therefore 
not symmetric. For values of n for which Hamming codes exist, the strategies attain the optimal 
losing probability Pl = (n + 1) . 

When n is not a Hamming length, we can construct a 1-covering of length n by taking the 
largest integ er nf such tlictt ft > nf — 2 m — 1, and defining C — TYm x Vn-ri (a trivial direct 
sum construction 2 ). In the corresponding strategy, players 1 through n' play the n'-player 
strategy ^Fu-mi an< ^ ignore teammates n' + 1 through n. The latter ignore everything, and they 
always pass. The losing probability for the overall strategy is Pl = {p! + 1) , which satisfies 
(n + 1) _1 < Pl < 2{n + 1) . The factor of 2 on the right hand side of the inequality chain is 
related to the density of the covering, defined as \i = \C\{n + l)2~ ra . For perfect codes, we have 
fj, = 1, while for the worst values of n in the direct sum construction (those of the form n = 2 m — 2), 
H gets arbitrarily close to 2. The latter is true for any linear construction, due to the fact that 
linear codes only come in sizes that are powers of two. It was shown in [3], however, that one can 
construct sequences of non-linear 1-coverings with limsup,^^^ = 1. For the derived strategies, 
the main term of the losing probability Pl approaches, asymptotically, the ideal (n + l) -1 . 
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3 The g-ary Game 



3.1 Strong coverings 

We now consider a game where the hat colors are drawn from a g-ary alphabet Q = {0, 1, . . . , q — 1}, 
for an arbitrary integer q > 2. The playing rules remain the same as in the binary case. The trivial 
strategy where one player always chooses, say, the color 0, and the other players pass, yields a 
winning probability 1 — Pl = q~ . Observing how 1-coverings in the binary case led to strategies 
with winning probabilities approaching one, it might be initially tempting to derive strategies for the 
g-ary game from good g-ary 1-coverings. This approach, however, quickly leads to disappointment, 
even in the case of perfect codes. A closer analysis reveals that a perfect code will lead to situations 
where, with high probability, one player can guess what color his hat is not, while the other 
players pass. Therefore, strategies derived from g-ary 1-coverings approach a winning probability 
of (g — 1) _1 (which is of course satisfactory when q = 2, but leaves room for improvement otherwise). 

As in the binary case, we characterize a deterministic strategy T in terms of its set C C Q n of 
losing configurations, and its complement W = Q n \ C. 

Proposition 2 Let w = (w±, W2, • • • , w n ). If w G W , then there exists a coordinate i G N 
such that for all x G Q \ {w{}, we have (wi, u>2, • • • , Wi-i, x, Wi + \, . . . , w n ) G C . Conversely, 
any pair C, W of complementary sets satisfying this condition defines a valid strategy for the q-ary 
game. D 

We call a set C as characterized in Proposition [2] a strong covering of Q n . The following proposition 
presents an analog of the sphere-covering bound for strong coverings. 

Proposition 3 Let C be a strong covering of Q n . Then, 

\C\ > ^4 (4) 
n + q — 1 

Notice that when q = 2, the bound Q coincides with the usual sphere-covering bound, and, in 
fact, strong coverings coincide in that case with 1-coverings. A strong covering of Q n is said to be 
perfect if it satisfies (|1J with equality. 

Proposition 4 A strong covering C is perfect if and only if the strategy Tc is perfect. □ 
Proposition 5 There are no perfect strong coverings for q > 2 and n > 1. □ 

3.2 Winning the g-ary game with probability approaching 1 

Assume that Q is endowed with an abelian group law. Let H m be the parity check matrix of the mth 
order binary Hamming code, and let n = 2 m — 1. The rows of H m are interpreted as characteristic 
vectors of subsets of N, rather than vectors over the finite field F2. H m induces a well-defined map 
ip : Q n — > Q m , with <^(v) = H m v T , where the inner product of a row [hn, hi2, ■ ■ ■ , hi n ] of H m and a 
vector v € Q n is defined as J2{j-.hij=l} v v W1 th addition taken over the abelian group Q. In analogy 
to the usual linear code terminology, we refer to <p(v) as the syndrome of v. Clearly, the kernel of 
ip is a subgroup of the n-fold direct sum Q n . Assume is the identity of Q, and define 

C = {v G Q n : y>(v) G {Q*) m } , Q* =Q\ {0}. 
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Proposition 6 C is a strong covering of Q n . 

Proof. Let w = (w\,W2, ■ ■ ■ ,w n ) G Q n \C. Then, </?(w) has some zero entries, and, thus, there is a 
column hj of H m such that hij = 1 if and only if v?(w)j = 0. Therefore, we have <£>(w) +ahj G (Q*) m 
for all a G Q* , and, hence, (u>i, u>2, • • • , Wj-±,x, Wj + i,w n ) G C for all x G Q \ {vjj}. The claim now 
follows from Proposition [2] and the definition of a strong covering. □ 

Notice that the "code" C is constructed as the union of those cosets of ker(( / 9) in Q n whose cor- 
responding syndromes have only non-zero components. 2 There are (q — l) m such cosets, out of a 
total of q m . It follows that the playing strategy corresponding to C has a losing probability 

p L = ( g -V" 1 = (1 _ g -l)log 2 (n+l) = (n + ^log^l-g- 1 ) ™ _ (5) 

When q is allowed to grow, while still maintaining q <C n, the expression © for Pjr, exhibits an 
asymptotic behavior of the form 0{n~ l l qlog2 ). Both the code C and the associated playing strategy 
can be efficiently computed. 



3.3 A generalized construction 

Strong coverings with a faster convergence of the relative size (and hence the losing probability) to 
zero can be obtained by generalizing the construction of Section 13.21 Define 

C = {v € Q n : <p(y) G (Q*) m or w%(v)) < (3m} , < < 1. (6) 

Here, (3 is a parameter to be optimized, and wt(-) denotes the Hamming weight . The construction 
of Section I5~2l corresponds to @ = 0. At first sight, using a value of /? > seems counter-intuitive, 
as it appears to increase the size of C. However, the inclusion of low-weight syndromes in the 
definition of C allows us to use a subset of the columns of H m , since the high- weight columns are 
no longer needed to match low-weight syndromes produced by vectors in Q n \ C. This can reduce 
the value of n for a given order m, which in turn might improve the vanishing rate of \C\/q n . 

Proposition 7 Let q > 5 and let j q be the nonzero root of the equation ^2(7) = 7log2(<7 — 1), 
where /^(O is the binary entropy function. Set (3 = 1 — 7 g , and redefine tp in @) as y(v) = H m v T , 
where H m is formed from the subset of columns hj o/H m such that < wt(hj) < j q m. Then, C 
is a strong covering of Q n satisfying \C\q~ n = 0(n~ c ), with 

□ 

For values of q < 11, it is advantageous to choose (3 = and revert to the construction of 
Section 13.21 However, for < g < u, the dominant term in the exponent c of Q is of the form 
l/(elog(<7 — 1)), which compares favorably with l/(glog2) in the first construction. 

Noga Alon [Hj, using a random-coding argument, showed the existence of strong coverings with 

P L < iq - 1)l0gn+1 + (l-q-r- (8) 
n 

For fixed q, the main term of this upper bound behaves like (q — 1) log n/n, compared to (q — l)/n 
for the sphere-covering-like lower bound (j3J. Finding an efficient construction that attains the 
bound (JSJ), and closing the gap between @ and © remain open problems. 

2 This runs somewhat contrary to the usual construction of linear codes. In particular, for q — 2, the 1-covering 
produced by the construction is the coset of the Hamming code whose syndrome is all-ones. 
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4 Variations 



We briefly describe some results on variations of the game described in Section ^ These results 
will be covered in detail in the full version of the paper. 

Non-uniform input distributions. The playing strategies described in previous sections 
are easily modified to accommodate non-uniform distributions of the hat color configuration. We 
assume the distribution used is unknown to the players. However, all they need to do is choose, 
in their strategy session, a random translation to be applied to C at game time. The translation 
cancels the effect of any non-uniformity in the input distribution, without affecting the covering 
properties of the original code. 

Randomized playing strategies. In this setting, players choose their declarations at random, 
conditioned on their observations. It is proved that from the perspective of each player, the overall 
probability of team success is a linear function of the parameters of the random strategy, which 
are in turn subject to linear constraints. It follows that an optimal winning probability is attained 
at a vertex of the convex polyhedron defined by these constraints, i.e., for a deterministic playing 
strategy. It follows that randomization does not help the team. 

Symmetric strategies. Players are indistinguishable, and are forced to use identical strategies, 
based on counts of observed colors. It is proved that for the binary game, the best symmetric 
strategy achieves a winning probability approaching 2/3 as n —* oo. This implies that the best 
unconstrained strategies for large values of n must be asymmetric. In fact, n = 3 is the largest 
value of n for which a symmetric strategy is optimal. 

Zero-information strategies. Players make a random declaration, without access to any 
information. A winning probability of 1/4 is asymptotically attainable and optimal. 

Acknowledgments. Thanks to Joe Buhler for introducing us to the problem and for many 
stimulating discussions, and to Noga Alon for his communication 6 . 
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